Modal Labs

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

00:17

2026-06-20

modal.com

large-language-models

Speculation Is All You Need

Modal Labs released state-of-the-art DFlash speculators for Qwen 3.5 and Qwen 3.6 models on Hugging Face, achieving 5-20% additional speedups and enabling Qwen 3.5 122B-A10B to run at over 1000 tok/s …

// co-occurs with top 7 entities

Z Lab 1 Qwen 1 Hugging Face 1 SGLang 1 vLLM 1 B200 1 DFlash 1